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Abstract 

We present a new approach to the problem of enumerating permutations of length n that avoid 
a fixed consecutive pattern of length m. We use this idea to give explicit upper and lower bounds 
on the number of permutations avoiding a pattern of length m. As a corollary, we obtain a simple 
proof of the CMP conjecture [7], regarding the most avoided pattern, recently shown by Elizalde [6]. 
Finally, we also show that most of the patterns behave similar to the least avoided one. 
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1 Introduction 

>'■ 

Let S n be the symmetric group of permutations of length n. Consider a permutation tt = (tt\ , . . . , 7r n ) G 
S n and a pattern a = (a%, . . . , a m ) G S m . Henceforth m will be a fixed integer while n will be 
considered to tend to infinity For any sequence of different positive integers X = (xi, . . . ,Xk), we 
define the standardization of X, st(x\, . . . ,Xf~), as the permutation in obtained by relabeling each 
element of X on the set {1, . . . , k} such that the order among all the elements of X is preserved. 

A permutation tt G S n contains a as a consecutive pattern if there exists < i < n — m such that 
st(7rj + i, . . . , 7rj +m ) = a, this is, there are m consecutive elements in tt that have the relative order 
prescribed by a. For instance, if a = (12 . . . m), tt contains a as a consecutive pattern if and only if it 
contains m consecutive increasing elements (a run of length to). A pattern tt G S n is called a -avoiding 
if it does not contain o~ as a consecutive pattern. Denote by Q n (cj), the number of permutations in S n 
that are cr-avoiding. 
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The problem of determining a n (a) is inspired by the problem of finding the number of permutations 
of length n that avoid a pattern a non necessarily in consecutive positions. A permutation it £ S n 
contains a if there exist 1 < i\ < ■ ■ ■ < i m < n such that st(7Tj 1 , . . . , 7Tj m ) = a. Clearly, if tt avoids a, 
then 7r also avoids a as a consecutive pattern. Knuth [11] introduced the latter problem and exactly 
determined the number of permutations avoiding some pattern of length 3. There are many interesting 
results in the area (see e.g. [3, 1]) as well as the famous Stanley- Wilf conjecture which was solved by 
Marcus and Tardos [12]. 

To provide an exact formula for a n {a) is a tough problem when n becomes large. However, asymptotic 
formulas can be derived as shown by Elizalde and Noy in [7]. They provide an estimation of a n (a) for 
any pattern a of length 3 and also for some patterns of length 4. Nowadays, an asymptotic formula 
of a n (a) is not even known for all the patterns of length 4. 

Elizalde in [5] showed that for any a £ S m , the following limit exists, 

Pa = hm — 

n— >oo \ n\ 

and that 0.7839 < p a < 1. In particular, it is known that a n (a) ~ cp™n!, for some constant c that 
depends on a. 

Whereas it is very hard to exactly compute p a for any a £ S m , it is possible to provide upper 
and lower bounds in terms of m. Besides, it is interesting to see which patterns are extremal in 
that sense. A pattern of length m is called monotone if it is either (12... m) or (m...21). It is 
clear that a n (12 . . . m) = a n (m . . . 21), since it £ S n is (12 . . . m)-avoiding if and only if its reversing 
tt = (ir n , . . . , 7Ti) is (m . . . 21)-avoiding. In [7] it was conjectured that monotone patterns are the most 
avoided ones among all patterns of length m, when n is large enough. This is known as the Consecutive 
Monotone Pattern ( CMP) Conjecture. 

Conjecture 1 (CMP conjecture [7]). For any a G S m , 

Pa < P(Yl...m) ■ 

The results in [7], determining p a for any a £ S% settle in the affirmative the CMP conjecture for 
patterns of length 3. Elisalde and Noy in [8] show that the conjecture is true for the class of non- 
overlapping patterns. They also study the monotone pattern and provide the exact value of P(\2... m ) 
implicitly as the smallest root of a formal power series. 

Regarding the least avoided pattern among all the patterns of length m, Nakamura in [13] posed the 
following conjecture, 

Conjecture 2 ([13]). For any o~ £ S m , 

Pa > P(12...m-2,m,m-l) ■ 

Both conjectures have been recently proved by Elizalde in [6]. The proofs are based on computing the 
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generating function for the number of c-avoiding permutations, P a {z) = ^«n(ff)|[, combined with 
the cluster method of Goulden and Jackson [9] . 

Here we will use a complete different approach to the consecutive pattern avoiding problem through 
the so called probabilistic method. While this approach is not as precise as the generating function 
technique, it is simpler. This means that it provides more direct proofs of some existing results, such 
as the CMP conjecture, and indeed, it allows us to go further in some directions, as will be seen in 
Section 5. 

Our first result bounds from above p a when the pattern a is not monotone. 
Theorem 3. For any a G S m \ {(12 . . . m), (m . . . 21)}, 

Pc<l-—.+o(-^\ . 

ml \m z m\ J 

The proof of this and all the following results, have a probabilistic flavor. We set out the problem 
through the Poisson Paradigm (see e.g. [2]) which asserts that, in a probability space, events that are 
nearly independent should behave similar as if they were so. To prove this theorem we make use of 
the Suen's Inequality [15], a powerful tool that provides an upper bound on the probability that some 
events do not happen at the same time. 

Theorem 3 can be extended to the whole set of patterns, S m , by weakening the upper bound: for any 
a G S m , 

Pa<l~—.+0 (—^—f) , 
ml \m ■ ml J 

however, this bound is not strong enough to prove the CMP conjecture. From the results given in [8], 
one can derive a lower bound on P(\2... m ) to show that the CMP conjecture holds for any large enough 
m. A more careful analysis on the constants hidden inside the asymptotic notation shows that it is 
enough to consider m > 5. 

The second part of the article is devoted to give a general lower bound on p a when a £ S m . 
Theorem 4. For any a G S m , 

1 /m-l\ 
Pa>l-—,-0[ — ^ . 
m\ \ [rn\y J 

To prove this lower bound we use a one-sided version of the Lovasz Local Lemma (see [14]). This bound 
is asymptotically tight and an extremal example is provided by the pattern (12 . . . m — 2, m, m — 1). 
Unlike in the case of the upper bound and the CMP conjecture, the proof of Theorem 4 can not be 
adapted to extract a proof of Conjecture 2. 

As Theorem 3 and Theorem 4 give bounds for the value of p a in terms of m, a natural question is to 
determine how do most of the patterns behave. In this direction a much stronger upper bound, close 
to the general lower bound, is shown to hold for most of the patterns. 
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Theorem 5. Let a G S m chosen uniformly at random. Then, for any 2 < k < m/2 




(m — k)\m\ 



) 



with probability at least 1 — 



(fe+l)! 



2 



_ m2 -m/2_ 



This theorem shows that most of the patterns behave similar to the least avoided one. The idea 
behind this result is that the number of permutations avoiding a pattern depends on the maximum 
overlapping position of this pattern. It can be shown that almost all patterns do not have a large 
overlap and thus, they are far from the upper bound attained by monotone patterns, the ones with 
maximum overlap. 

This paper is organized as follows. In Section 2, Theorem 3 is proven. A lower bound on p(i2... m ) is 
derived in Section 3 completing the proof of the CMP conjecture. Section 4 is devoted to the proof of 
Theorem 4. Finally, in Section 5 we provide the proof of Theorem 5. 

2 An upper bound on p a 

Consider the set of events A = {^4i, • • • ,An} with associated indicator random variables X±, . . . ,Xn 
and let X = ^^Aj- In general, the events in A will be considered to be bad and the aim is to 
bound, either from above or from below, the probability that none of these bad events occurs. We 
will denote by p the expected number of bad events, this is, fi = E(A) = J2iLi P r ( j 4«)- 

A dependency graph of A is a graph H with vertex set V(H) = {1, . . . , N} where if two disjoint subsets 
S,T C [N] share no edges then {A^^s and {Aj}j & T are independent. 

Two parameters are defined to control the dependencies among all events. To measure the global 
effect of the dependencies, consider 



A= £ Pr(An^), 



ij€E{H) 



and for the local one, 



5 = 



max 

l<i<JV 





j-.ijeE(H) 



where E(H) denotes the edge set of H. 



We will use the following version of Suen's inequality (see e.g. Theorem 2 in [10]), 



Theorem 6 (Suen's inequality). With the above notation, 




(1) 
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Suen's inequality bounds from above the probability of having no bad events in terms of the expected 
number of events, but also takes into account the pairwise dependence of the events. Thus, if the 
dependencies among the events are weak or unlikely, we will be able to give a meaningful upper bound 
on such probability. 

Let 7r G S n chosen uniformly at random, and let a G S m be a fixed pattern. For any < i < n — m 
we define the event Ai := {st(7Tj+i, . . . ,7Ti +m ) = a}. Then tt avoids a as a consecutive pattern if and 
only if X = 0, this is, no copy of the pattern a appears. By computing the probability of this event, 

a n (a) = Pr(X = 0)n\ , 

where Pr(X = 0) depends on a. In particular we will be interested in 

p a = lim Pi(X = 0) 1/n . (2) 

n— >oo 

Bounding from above the number of edges in a dependency graph H is crucial in order to give a proper 
upper bound on the probability that any of the events happens at the same time. The following lemma 
shows that there are many pairs of sets of events that share no edges. 

Lemma 7. Let S, T C {0, 1, . . . , n — m} be two disjoint subsets of indexes such that for any i G S and 
any j G T, we have \i — j\ > m. Then, the events {Ai}i e s and {Aj}j^x are independent. 

Proof. For any two disjoint sets U + , U~ C {0, 1, . . . , n — m}, define the event 

a u+,u- ■= I n Ai a n Ti 

[iec/+ ieu- 

It suffices to show that for any two disjoint subsets S + , S~ C S and T + , T _ C T, 

Pr (A s+tS - | A T+>T -) = Pr (A s+>s -) . (3) 

We say that j G {0, 1, . . . , n — m} belongs to the support of S, supp(S), if there exists i G S such that 
0<j — i<m — 1. Observe that supp(S) n supp(T) = 0, by the assumptions on S and T. Clearly, the 
event A s +^ s - is determined by the elements appearing in the positions indexed by supp(S). 

Denote by T C S n the subset of permutations of length n that satisfies A T + T -. Choose r G T 
uniformly at random. It is enough to consider r restricted on supp(S), r', and show that its stan- 
dardization, st(r'), is uniformly distributed in S\ supp (s)\- 

The key observation is that A T + T - might condition which elements lie in supp(S) but does not 
impose anything on their order. The event A T + T - makes no direct restriction affecting the order of 
the elements in supp(S). Therefore, the elements appearing in r' may be conditioned by A T + T ~, but 
st(r') is not affected by A T + T ~. Since Ag+ g- is satisfied in r' if and only if, it is satisfied in st(r') 
(with the corresponding relabeling), equation (3) holds. □ 
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The previous lemma suggests that a good dependency graph for the set of events A is the circulant 
graph H with vertex set V(H) = {0, 1, . . . , n — m}, where ij G E(H) if and only if < \i — j\ < m. 
Throughout the paper, we will use the former circulant graph as a dependency graph of A. 

A simple upper bound follows directly from the previous observation. Consider I = {km : < k < 
n/m}, then 

Pr(X = 0) = Pr( f| A) <Pr (f|A] = J] I 1-Pr I A| f| Aj 

V i=o / Kiel / iei \ \ jei,j<i 

By using Lemma 7 with S = {i} and T = {j : j £ I,j < i}, 



i-pr|Ai n ^) =i - pr ^) =i -^ 



Since |7| < n/m, this implies 



1 x 1/m , , 

Pa < [ 1 - -T =\-0' 



ml J \m • m\ 

However, a better bound is given in Theorem 3 by taking into account the interaction between pairs 
of dependent events. 

A pattern o G S m has an overlap at k if st(eri, . . . , cr^) — st(<7 m _fc_|_i, . . . , cr m ), this is, the first and the 
last k positions have the same relative order. If a pattern does not have an overlap at k, then 

Pi(A t n A i+m _ k ) = . (4) 



For any 1 < k < m— 1, define the set Mk C S m as the set of patterns of length m that have no overlap 
larger than k. The elements in A4i are called non- overlapping patterns. They have been enumerated 
in [4] and also extensively studied in [6]. 

Observe that M m -i is the whole set of patterns of length m. One of the crucial facts to prove 
Theorem 3 is to show that M. m -i \ M-m-2, the set of patterns that have an overlap at m — 1, only 
consists of the monotone patterns. 

Lemma 8. For any m > 3, 

Mm-i \ M m - 2 = {(12 . . . m), (m . . . 21)} . 



Proof. It is clear that both monotone patterns belong to A4 m _i \ A4 m _2- Let us show that any other 
a G S m \ {(12 . . . m), (m . . . 21)} does not. Suppose that a G M m -i \ M. m -2- This implies that 

St((7i . . .(T m _i) = St(<7 2 ■ ■ -O-m) ■ (5) 
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Since a is not a monotone pattern, there exists an index 2 < i < m — 1 such that <7j_i > cij < <Ji + \ or 
cjj_i < o; t > (Tj+i. Without loss of generality we assume the latter. Now observe that (5) implies that 
if cij_i < (Tj, then Oi < 0"i+i, leading a contradiction. □ 

Thus, we can consider that the maximum overlap of a pattern a G S m \ {(12 . . . m), (m . . . 21)} is at 
most at m — 2. This can not be improved since there are non monotone patterns that have an overlap 
at m — 2. For instance, consider m = 2t and a = (1, t + 1, 2, t + 2, . . . , t, 2t). 

The following lemma gives some insight of the structure of the permutations that contain two given 
occurrences of a pattern a. 

Lemma 9. Let a G S rn be a pattern with an overlap at k and suppose that r G S2 m ~k is such 
that the events Aq and A rn ^k hold. If a' = st(<7 m _fc+i, . . . , cr m ), then, for any < % < k, we have 

T~m—i = °~k—i 0~m—i ~ ®k— i' 

Proof. Fix some i < k. By the event Aq, we know that T m _j should be larger than o~ m —i ~ 1 el- 
ements and smaller than m — o~ m -i elements from (n, . . . , r m _j_i, r m _j+i, . . . , r m ). By the event 
A m _k, it is also true that r m _j is larger than ak-i ~ 1 an d smaller than m — cr^-i elements from 

(Xm—k+l i • • • i Tm—i—li Tm—i+lj • • • i ^Im—k)- 

Consider now the permutation a' = st(cr m _fc+i, . . . , a m ) G Sk- Then there are o~' k _ i — 1 elements that 
are counted twice when we look at the elements smaller than <r m _j or a^-i, and k — also double 
counted when we look to the larger ones. Therefore 

T~m-i > Ok-i + Cm-i — 2 — (<T^_j — 1) , 

and 

T m -i < 2m - k- (m- a k ^i + m- a m ^i - (k - cr^._J) . 
Observing that the first inequality is strict, 

T m-i = + 0~m—i ~ °~k-i- 

□ 

Using this last lemma, we can provide an upper bound on the probability that two given occurrences 
of a pattern appear. 

Lemma 10. For any a G S rn and any 1 < k < m — 1, 

^m— k 

PT(Ai A A t+m ^ k ) < — . 

A/7r(m — k) ■ (2m — k)\ 

Proof. If a does not have an overlap at k, Pr(Aj A Aj +m _fc) = and we are done. Thus, assume that 
a has an overlap at k. 
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Set r = st(7Tj+i, . . . , vrj + 2m-fc)- Recall that it £ S n has been chosen uniformly at random, which implies 
that r is uniformly distributed in <S 2 m-fc- Moreover, tt satisfies A, and A i+m _k if and only if r satisfies 
A and A m _ k . 

There are (2m — k)\ possible candidates for r. We will count how many of them are such that the 
events Aq and A m _ k hold. By Lemma 9, we know that the elements (r m _fc+i, . . . ,r m ) are uniquely 
determined by a and k. Thus, one must select a subset ofm — k elements among the 2m — 2k available 
ones, to construct (n, . . . ,T m _fc). Since r satisfies Ah once these elements have been chosen, there is 
just one order such that st(ri, . . . , r m ) = a, and only one way to set the last m — k elements of r, in 
order to satisfy A m _ k . 



Hence, for tt £ S n , 



/2(m-k)\ AJn _ k 
Pr(i,Ai i+m _ fc )< m -\> < 



(2m-k)\ ^Tr( m - k) • (2m - k)\ 



where we have used that ( 2 ^) < One can prove this last inequality by using Stirling's approxi- 

mation. □ 



Now we are able the proof the main theorem. 

Proof of Theorem 3. First of all we compute /j,, A and 5, needed to apply Suen's inequality. The 
expected number of occurrences of the pattern a does not depend on a and can be computed as 

n—m . 

E_. . . . n - m + 1 n 
Pr(Ai) = ■ < — . 
ml ml 

i=0 

Recall that by the choice of the dependency graph H (inspired by Lemma 7) two events Ai and Aj 
share no edge if \i — j\ > m. Assume that i < j and j — i = m — k, then by Lemma 10, 

Pr(AAA,)< iffc<m-2. 



and, since a is not monotone, by (4) and Lemma 8, 

Pr(Ai A Aj) = if k = m - 1 



Hence, 



i+m-l m-2 ^m-k 



j=i+l 



^ v / 27r(2m - fc)! 
I I- + —— + 0(m- 2 )) -=- -< iT 



m + 3 7 v^7r(m + 2)! ~ v^r(m + 2)! ' 
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for any m large enough. 
Then, A can be expressed as 

n—mi-\-m—l -. — 

Since the degree of a vertex in the dependency graph H is at most 2(m — 1), 

S = max V Pr(A 7 -) = 2(m - 1) Pr(A) = 2 ( m ~ ^ < 2 . 
o<i<n-m ^ Jy m! ~ (m-l)\ 

Using that e 25 < e 4 /( m - 1 ) ! < 2 if m > 4, Suen's inequality (see (2)) implies that for a large enough m 

1- 34 



. v / 2^(m+2)(m+l) 

Pa < exp 



m! 

i 34 



< m! v / 2^(m+2)(m+l)m! 



< 1-1-0 



1 + 

m! 

m\J J \m\ ^/2ir(m + 2)(m + l)m\ 



1 14 

< 1 r + 



m! m 2 m! 



for any large enough m. We have used that e a < 1 — for any a > 0. □ 



3 A probabilistic proof of CMP conjecture 

In this section we aim to provide an alternative proof of the CMP conjecture. We do it by obtaining 
a lower bound on P{\2... m ) an d showing that this bound is larger than the upper bound obtained in 
Theorem 3. A recent result of Elisalde and Noy gives an implicit expression for p<i2...m)- 

Theorem 11 (Elisalde and Noy [8]). Let zq = p^ 2 m y then zq is the smallest solution of 



(mi)\ ^— ' (mi + 1)! 

i>0 v ' i>0 y ' 



From this last theorem we can extract an explicit lower bound on P(\2...m)- 
Lemma 12. For any m large enough, 

1 1/1 

P(V2...m) > 1 r H r + O — - 

v ' ml m ■ ml \ m z ■ ml 
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Proof. Observe that 



m+l z 2m 



f(z) = l- z + — - + — — > g(z) , 

ml (m + 1)! (2mj! 



since g(z) is an alternating sum whose terms are strictly decreasing. Since g(0) = 1 and zq is the 
smallest root of g(z) we can conclude that z\, the smallest root of f(z), is at least zq. Thus p(i2... m ) > 
\/zi and it suffices to compute an upper bound on z\. 

Write z = (1 — then z~ 2m f(z) = becomes 

- (1 - £ » £ + WlF (m - (m + 1)E) + Wf- = ' 

Using 1 — nx < (1 — x) n < 1 — nx + n 2 x 2 , 



1 - (m — l)e + (m — l) 2 e 2 . . . 1 

< -(1 - (2m- l)e)e + j 1 (m-(m + l)e) + 



(m + 1)! v v y 7 (2m)! 

/ (m-l)(m 2 + l)\ 9 / m 2 + l\ / m 1 \ 

Let e' be the solution of the last equation with equality. Then, P(\2...m) > (1 — £ ')- If m is large enough 
we can get an asymptotic expression for e' . Suppose that b 2 3> 4ac, then the smallest solution of 
ax 2 + bx + c = can be approximated by 



This leads to 



c ac ^ 

x = - -r + O . 

b b 3 \ b 5 



a 2 c 3 



(8) 



(,,, + D! + 1.2,(0! + q / '"' 



1 + 



m 2 + l 
(m+l)! 



(m + l)! 2 



m 



(m + l)! 



+ o 



m 



(m+l) 



!2 



m 
1 



n 



+ o 



m! m • m! 



• ml 



where we have used (1 + x) 1 = 1 — x + 0(x 2 ) in the second and the last inequalities. This proves the 
lemma. □ 

The CMP conjecture comes as an straightforward corollary of Theorem 3 together with Lemma 12. 
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Corollary 13. For any large enough m, the CMP conjecture is true. Moreover, for any a G S m \ 



Thus, this corollary does not only show that the CMP conjecture is true, but also provides a lower 



Note that the last corollary holds for any m large enough. An upper bound without any assumption 
on m that can be derived from (1) and (6) in Theorem 3. Comparing this bound with the exact lower 
bound that follows from (7) in Lemma 8, one can check that the CMP conjecture holds for any m > 5. 

As we will see in the next section, it is also possible to provide a lower bound on p a in terms of m 
without using generating functions. Unlike the upper bound case, the lower bound just takes into 
account the number of dependencies among the events, but not the nature of these dependencies. It 
might be interesting to give a direct proof of the lower bound for the monotone pattern (Lemma 8) , 
which does not rely upon any other result. For such a purpose it would be useful to understand the 
probabilities Pr (^Ai | Dj<j A?) when a = (12 . . . m). 

4 A lower bound on p a . 

The setting used to give an upper bound to the number of permutations avoiding a given pattern can 
be also used to provide a lower bound on p a . Now we need a way to bound from below the probability 
that X = and for such a purpose we will use the Lovasz Local Lemma. 

Usually, the Local Lemma is used to show the existence of a certain configuration that does not satisfy 
any of the bad events in A. However, in our problem it is trivial to see that for any pattern a £ S m 
there exists at least one permutation of length n that avoids a. Nevertheless, it also provides an explicit 
lower bound on the probability that such configuration exists, giving a lower estimation on the number 
of such configurations. We will use it to derive a lower bound on the number of permutations of length 
n that avoid a. 

The following version of the Local Lemma was proposed by Peres and Schlag in [14] and it is convenient 
for our approach. 

Lemma 14 (One-sided Local Lemma). Let x±,X2, ■ ■ ■ , xjy be a sequence of numbers in (0, 1). Assume 
that, for every i G N, there is an integer < m(i) < i such that 



{(12... m),(m... 21)} 




estimation of the minimum gap between P(\2...m) an d pu-, for any a £ S m \ {(12 . . . m), (m . . . 21)}. 





Then. 




(10) 
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To use the Local Lemma a dependency graph on the set of events must be set. In the case of the 
one-sided version, the graph is defined implicitly in (9) as the directed circulant graph with out-degree 
i — m(i). Thus, the same dependency graph used for Suen's inequality is also valid to apply the Local 
Lemma. 

Next, we give the proof of the lower bound on p a . 



Proof of Theorem 4- Let A = {Aq, . . . ,A n - m } and X be defined as in Section 2. Set m(i) = i — m + 1. 
Using Lemma 7 with S = {i} and T = {0, 1 . . . , i — m} 

Pr I A, | p| A~\ = Pr(A i ) . (11) 

y j<i-m J 

Since all the events are symmetric we set Xi = x, for any < i < n — m. Then, condition (9) becomes 

Pr(Ai) < i(l-i)" 1 - 1 . (12) 

Recall that Pr (Ai) = Aj. Thus, the previous equation implies that x > Af . Besides, we are interested 
on keeping x as small as possible, because of (10). Let us write x = f° r some positive function 
f(m). Hence, using (1 — x) < e~ x , condition (12) implies 

e /M( m - 1) 



m 



< f(m), 



m — 1 

which also implies f(m) > m ^r, since f(m) > 0. By setting x = e ™i , condition (9) is satisfied and 
the Local Lemma can be applied. In particular, we obtain the following lower bound on the probability 
that X = 0, 



Pr(X = 0) = Pr ( P| At ] > ( 1 

and using (2), 



e (m-l)/m! \ 



i=0 



e (m-l)/ml i (m-l 

P° - 1 i = 1 \~°\ T~a\ I ■ 

ml ml \ {m\ z ) J 

□ 



The lower bound given by Theorem 4 is tight. This can be shown using a result of Elizalde in [6], 
where the author proved that the least avoided pattern is (12. .. m — 2, m,m — 1). The author also 
gives an implicit lower bound to zq = p~^ 2 m -2 mm ~i) as ^ ne smallest root of 



f(z) = 1- z-\ - m 



z 2m+l 



ml (2m - 1)! 
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An explicit upper bound can be derived from the previous equation, as in Lemma 8. 

1 „ / m — 1" 



1 / 
P(12...m-2,m,m-l) < 1 ~ —j ~ ° ( 



(ml) 2 

In order to prove Conjecture 2, one could try to use the same strategy we have used for the CMP 
conjecture. First, determine the subset of patterns a such that a n (a) = a n (12 . . . m — 2, m, m— 1) and 
finally, improve the lower bound for the patterns which are not in the previous subset. However, this 
approach is hopeless to tackle Conjecture 2. Notice that no assumption on the properties of the pattern 
has been used in the proof of the lower bound, like in the proof of the upper bound in Theorem 3. 
Unfortunately, the Local Lemma can not distinguish the different nature of the dependencies among 
events. Thus, no better lower bound can be achieved by restricting to a smaller subset of patterns. 
This is also the main problem to prove Lemma 8 using our approach. 

In the next section we will improve the upper bound of Theorem 3 for large subsets of patterns. 



5 The typical behavior of patterns. 

The results of the previous sections provide tight upper and lower bounds on p a for any a G S m . In 
this section we want to show that, for a typical pattern, p a lies much closer to the lower bound than 
to the upper bound. This is, the number of a-avoiding permutations of length n, when a G S m chosen 
uniformly at random, is closer to the number of permutations that avoid (12 . . . m — 2, m, m — 1) than 
to the number of permutations that avoid (12 . . . m). 

Define Mk C S m as the set of patterns of length m that overlap at position k. The following lemma 
bounds from above the size of these sets. 

Lemma 15. Let a G S m chosen uniformly at random, then 

1. Pr(cr G M k ) = sf if2<2k<m. 

2. Pr (<T G M k ) < 2- m l 2 ifm<2k< 2(m - 1). 

Proof of 1. Choose a G S m uniformly at random. Recall that the condition for a G A4 is that 
r 1 = st(<7i, . . . , (7fe) and r 2 = st(<r m _fc+i, . . . , a m ) are equal. If 2k < m, then r 1 and r 2 are independent 
by Lemma 7 and uniformly distributed in Sk- For any r, r' G Sk 

Pr^ 1 = t \t 2 = t')= Pr^ 1 = r) . 

Thus, we can compute the exact probability of being in A4 

Pr (<T G M k ) = Pr(r x = r 2 ) = V PtOt 1 = r A r 2 = r) = kl Pr^ 1 = r) 2 = 1 . 

kl 

T&S k 

□ 
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Proof of 2. Choose a G S m uniformly at random. Partition the pattern a in parts of length m — k by 
defining f L = st(cr( m _ fe vj_ 1 \ +1 , . . . , 07 m _ fe v) for any 1 < i < L^TfeJ • Observe that, in order to have an 
overlap at k we must have t\ = Ti for any i > 1. This condition is clearly necessary but not sufficient 
for a pattern to overlap at k. 

Since 2k > m, we have at least l^r;: J > 2 parts. By the choice of a, the permutations r* are uniformly 
distributed in S m -k, and by Lemma 7, they are mutually independent. This implies, 



Pr(a g A4) < rjPrft = ri) = 1 ^ 2 " m/2+1 



for any k < m — 2. Iffc = m — 1, 7V m _i is the set of patterns with an overlap at m — 1 and the upper 
bound is directly implied by Lemma 8. □ 

Unlike in the case when k < 2m, where we can determine exactly the size of A4, a non tight upper 
bound is given when k > 2m. Observe that the sets Nk cover all S m but they are not a partition of it. 
For instance, monotone patterns belong to all such sets, since they overlap at any possible position. 
However, we conjecture that 

|A4|<i, 

for every 1 < k < m — 1. 

We use the previous lemma to give a lower bound on the size of Aik, the set of patterns that have no 
overlap at any positions larger than k. 

Lemma 16. Let a G S m chosen uniformly at random. Then, for any 1 < k < m/2, 

Pr (a G M k ) > 1 - ^^yy - m2' m l 2 . 

Proof. Observe that we can bound from below the size of Mk using the sets A/fc, 



m—1 



S m \ |J Me 

e=k+i 



m—l 



> m! - M • ( 13 ) 

£=fc+l 



By Lemma 15, for any k such that 2k < m, 

m-l 

Using the relation in (13) gives 

m-l m/ 2 „ 

Pr(a€M k )>l- Pr ( c7 € Mg) > I" E J\- m2 ' ml2 - l -JkTT)\ ~ m2 ~ m/2 - 

e=k+i £=k+i ' I + J- 



□ 
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Recall that M.i corresponds to the set of non-overlapping patterns. The proof of Lemma 16 implies 
that \Mi\ > (3 — e)m\. This bound can be refined. Indeed, Bona [4] showed that 

0.364098149 < < 0.3640992743 . 

m! 

The previous bound on \Mk\ is clearly non sharp. A better estimation of the size of A4 when 2k > m, 
would help to understand the distribution of p a when a G S m is chosen uniformly at random. 

Next lemma shows that a better bound on A can be given if the pattern does not have a large overlap. 
Lemma 17. For any a G M-k, 

^m—k 

A < -, r-rn . 

- (2m- A;)! 

Proof. Since a G M.k we have Pr(Aj A Ai +m _j) = for any j such that k < j < m — 1. Using 
Lemma 10 

n—m k 

A < ^ J2 Pv ( A i A A i+m-j) 
i=0 j=l 

< n) —^=^= 

~[ y/n(m-j)(2m-j)\ 

<C 77 

" (2m - k)\ ' 

□ 

Proof of Theorem 5. Assume that a G Mk- It follows from Lemma 17 that 

< " ~7 = — : < 



H~{2m-k)\ [ 2m ~ k ) {m - k)\ ~ (m-k)\ ' 

since 

e 28 < e 4/(m-l)! < 2 for 

any m > 4, using (1) we can derive the following upper bound, 

Pr(X = 0) < exp (- l ~°^f^ n 
I ml 

From (2), 

p a = lim Pr(A = 0) 1 /" <l- — +o(— , 
n->oo m! ^(m — ky.ml J 

where we have used e~ a < 1 — t^-. 

— l+a 

This upper bound holds when a G A4&, and this holds with probability at least 1 — j^fji ~ rn2~ m / 2 
when a is chosen uniformly at random, by Lemma 16. □ 

Acknowledgement. The author is grateful to Marc Noy and Oriol Serra for helpful discussions. 
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